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Abstract 

111 this paper we study two basic facts of optimal transportation on Wiener space W. 
Our first aim is to answer to the Monge Problem on the Wiener space endowed with the 
Sobolev type norm || .||^ ^ with p > 1 (cases p = 1 and p > 1 are considered apart). The 
second one is to prove 1— convexity (resp. ^—convexity) along (constant speed) 

geodesies of relative entropy in {'P2{W),W2), where W is endowed with the infinite 
norm (resp. with ||.||fe.7); and W2 is the 2— distance of Wasserstein. 

1 Introduction 

We are interested in two problems in optimal transportation on Wiener space. We 
refer to a recent work of Ambrosio and Gigli [3J for a survey and basic tools of optimal 
transport theory. 

At first we answer to the Monge Problem on the Wiener space relatively to the cost 
(introduced by Airault and Malliavin in [Ij) ||.||^. ^ defined as : 

for suitable parameters k,j. 

Monge Problem is largely considered in several settings since for years. Nowadays 
there are a lot of means to prove the existence of an optimal map resolving this Problem, 
which are summarized in |2]. 

Recently, there was considerable advances concerning Monge Problem in M" . First 
in 1996, Gangbo and McCanii solved Monge Problem in [15J when the cost is strictly 
convex. Then people are interesting in the case of different norms on M". Indeed the 
problem becomes more difficult since a norm is never strictly convex. In 2003, Monge 
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Problem was solved when the cost was a crystalline norm by Ambrosio, Kirchheim 
and Pratelli in [5|. When the cost is a general norm, Monge Problem was solved 
independently by Champion, De Pascale in [9_ and by Caravenna in [7] in 2010. All of 
these latter cases we lose unicity of optimal map. 

In this paper, we turn our attention on the Wiener space, an infinite dimensional 
space. The Monge Problem in the Wiener space was solved by Feyel and Ustunel 
in [13] for the cost induced by Cameron-Martin norm \.\h which is Hilbertian. 
With this cost, we have unicity of optimal map. Strategy of latter authors was to pass 
by finite dimensional approximations of Wiener space : the aim being to reduce the 
Monge Problem on Wiener space onto finite dimensional spaces and applying known 
results. Then they used a selection theorem to go back up on Wiener space. Then 
Monge Problem was solved by Cavalletti in [B] for the cost \.\h, again passing by finite 
dimensional approximations. 

We are interested to endow the Wiener space {W, H, /i) (where fi is the Wiener 
measure) with two other natural norms : the infinite norm |.|oo and the Sobolev type 
norm For the first one, Monge Problem is still open, and we can expect not to 

have unicity of optimal map, providing it exists somehow. Here the norm considered 
||.||fc_^ is not Hilbertian and is weaker than the Cameron-Martin norm in sense that for 
some Ck,^ > : 

Mk,j < Ck,j\x\H forallxeiy. (1.1) 

Let us emphasize that the right hand side is equal to infinite ^—almost everywhere, 
because of zero measure of Cameron-Martin space. Nevertheless our norm ||.||fe,7 sat- 
isfies suitable conditions presented in section 2.1. The first aim of our paper is to solve 
Monge Problem for the cost ||.||^ ^ : 

inf / \\Gix)~x\\l^dpoix). (1.2) 

In other words we will establish following theorem : 

Theorem 1.1 Let po and pi be two measures on W satisfying condition lil.3\) below. 

1. If p > 1 and Po is absolutely continuous with respect to fi, then there exists a map 
T unique up to a set of zero measure for po, which minimizes Moreover 
there is a unique optimal transference plan between po and pi relatively to the 
cost ||.||^^; which is exactly {Id x T)^pQ. 

2. If p = 1 and both po and pi are absolutely continuous with respect to fi, then 
there exists an optimal transference plan H between po and pi relatively to the 
cost ||.||fe7, such that H is concentrated on a graph of some map T minimizing 

The second purpose of this paper is to find a lower bound of Ricci curvature in the 
sense of Sturm in [19J, for Wiener space endowed with the infinite norm |.|oo or the 
norm ||.||fc.-y. It is relied to weak convexity along geodesic of relative entropy. More 
precisely we will prove : 
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Theorem 1.2 If po and pi are probability measures onW both absolutely continuous 
with respect to fi, then there exists some (constant speed) geodesic pt induced by an 
optimal transference between po and pi such that : 



For the infinite norm, we will see that K equals to 1, and for the Sobolev type 
norm that K equals to 1/C|^. Precise that Lott-Villani introduced in [T7] a 

stronger notion of weak i^— convexity of relative entropy. Indeed in its definition, it is 
required that the property above holds for all (constant speed) geodesies. So in many 
cases weak i^T— convexity and A'— convexity coincide, provided that there is unicity of 
geodesic between two given measures (this is the case for non branching spaces). An 
example it fails when optimal coupling is not unique. 

Now let us briefly summarize the following sections. 

Throughout section 2, we resolve the Monge Problem in the Wiener space with 
the cost in different ways according to parameter p. Among of these, there is 

a direct method : in general settings, the support of an optimal transference plan 11 
(between two probability measures and relatively to a cost function c) is included in 
c— subdifferential of a c— convex function (called potential of Kantorovich) (j). It leads 
to the following system : 



And this system can be solved directly when the cost c and the potential (f) are differen- 
tiable, so long as V xc{x, .) is injective, as it is explained in Villani's book [2n|- This is 
the case when p> 1. But this method fails when p =1. In the latter case we head for a 
recent paper of Bianchini and Cavalletti [6] where the authors resolve Monge Problem 
in non branching geodesic metric spaces. It turns out that Wiener space endowed with 
the norm ||.||fc,7 is a such space. Simply we will verify suitable conditions. 

Section 3 is devoted to establish the AT— convexity along geodesic of relative en- 
tropy (w.r.t. Wiener measure) on the Wiener space endowed with the infinite norm 
{K — 1), then with the Sobolev type norm H.Ufc^-y {K — 1/C^^). This time we will 
process by finite dimensional approximation as Fang, Shao and Sturm in |12| . who have 
treated the case of the norm \.\h- This part requires Wasserstein distance, which is 
defined below. Our main contribution consists in establishing results without applying 
powerfuU tools like Gromov-HausdorjJ convergence (see |17) ) or D— convergence intro- 
duced by Sturm in [19]. In the language of latter authors, we can say that (W, |.|oo) 
is a C-D(l,oo) space (satisfies such curvature-dimension condition) and (W^, || -11/0,7) 
is a CD{1/Cl^,oo) space. Such conditions imply a lot of important results over 
Wiener spaces. As consequences over spaces (TV, |.|oo), (VK, || .||fc^-y), we can quote Brunn- 
Minkowski, Bishop-Gromov or also Log-Sobolev inequalities (see [3]). 



Ent^{pt) < (1 - t)Entf,{pa) + tEnt^{pi) 



Kt{l-t) 
2 



W^I,oo(Po,Pi) Vie [0,1]. 



<jf{y) — (j){x) = c{x,y) n — almost everywhere 
4''^iy) ~ (f'i^) ^ c(x,2/) everywhere 
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1.1 Settings 

Our ambiant space will be the classical Wiener space {W,H,fi) = (Co([0, 1], R), iJ(M), /i) 
where H{R) {h : [0,1] — > M; h{t) = h{s)ds and h £ L'^{[0,1])} and fi is the 
Wiener measure. _ff (R) is a Hilbert space with the inner product : 

ih,g)H ■■= [ h{t)g{t)dt. 

Important facts are that H is dense in W with respect to the uniform norm, and 
moreover fJ.{H) — 0. 

Given two Borel measures po and pi on W, let us state a condition of preserving 
mass, necessary for all our discussion : 

dpo = / dpi < +00. (1-3) 
w Jw 

In particular it is satisfied when po and pi are both probability measures. When (jl.3p 
is satisfied, we will consider many times the following Monge-Kantorovich Problem 

inf Z,(n)= inf / \\x~y\\l^dnix,y), (1.4) 

which is a relaxed problem of Monge Problem (|1.2I) stated above. At the moment of 
the cost c is continuous and the ambiant space is Polish, there is always an existing 
minimizer for the Monge-Kantorovich Problem (|1.4|) (see for example [4J). A such mini- 
mizer will be called an optimal coupling or optimal transference plan between po a-nd pi . 

We endow the space W with the following norm : 

such that < 7 < 1/2, and 2 < 1 + 2fc7 < k where k is an integer. In fact this is 
a pseudo-norm over W since it can take infinite value. For this reason, we consider 
W :^ {w £ W; \\w\\k,-y < oo}. It is well known that /i(W^) = 1 hence for a sake 
of notation, we will write W = W. (W^, ||-|U,7 is a separable Banach space, and all 
measures considered in the sequel will be Borel with respect to the topology induced 
by the norm ||.||fe.7. H is still dense in {W, ||.||fc.7)- We can write : 



where Ck.-y ( \t - s]^-^-"^^-* dtdi 



l/2k 
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Before to continue, let us explain how we can decompose the classical Wiener space 
in finite dimensional spaces. Consider the projections 7r„ : W — > W defined as : 



k + l\ fk 
2" 



if t G 



k k 



2" 2" 



At a continuous path, 7r„ associates its afHne part. Denote by Vn '■= 7r„(VF). We know 
that Vn is spanned by the Haar functions and therefore has dimension 2" and then will 
be identified with R^. 

Denote by V^^ the subset of W which is image of W by the map Iw — T^n- Since 
7r„ is a projection, one can write W = Vn(B V^- 

An important fact is the following : the image measure of the Wiener measure ji 
by 7r„ is the standard Gaussian measure on Vn- In other words (7r„)^/x = 7^, where 
the indice d is the dimension of Vn i.e 2" = d. For a sake of simphcity, we will denote 
{■Kn)n^lJ- = In instead of 7^. 

Recall that the Sobolev space over the Wiener space D^(iy) is the set of all /z— measurable 
function F G U'{W, ^i) such that there exists VF e L^iW, H) where we have /i— almost 
surely : 

(VF(u;), h)H = DhF{w) := lim ^ ~ ^''^^ in LP{W, fi) \/h e H. 
We will denote 

D~(W^) := Pi B'liW) and E(F) :=E^(i^). 

p>0 

In section 3 we will need 2— Wasserstcin distance between two probability measures 
po and pi on a measurable space W, defined as : 

Wl,{po,pi):=^ mi f c{x,yfdn{x,y) (1.5) 

where r{po, pi) is the set of coupling measures between po and pi i.e the measures on 
W X with marginals po and pi. Moreover we will denote by Tq{pq,pi) the set of 
optimal couplings for c? or equivalently the set of couplings which realize minimum on 
2— Wasserstein distance induced by the cost c. A sufficient condition (but not natural) 
for that the Wasserstein distance is that po and px have second finite moments. To 
justify terminology, notice that W2,c is well a distance so long as c is a distance on W 
and in this case the space 

V2{W) := |/9 probability measure on W\ J c{x,xofdp{x) < 00 for some xq € 

endowed with W2,c is a metric space. 

In all this paper, Pvi (with 1 < i < N) stand for the projections onto the i—ih 
component : 

Pn : Xi X X2 x...XN^Xi, 
where iV G N depends on the context. 
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2 Monge Problem on Wiener space with ||.||^^ 

We can now focus on the Monge Problem. Notice that {W, ||-||fc,7) is a Pohsh space 
and in addition the cost c{x,y) = \\x — y\\^ ^ being continuous (for aU p > 1), there is 
always an existing measure 11 G r(po,Pi) which attains the minimum in the Monge- 
Kantorovich Problem (|1.4p . 

For po, pi two measures on W satisfying p.3p . recall that the Monge problem 
between po a-nd pi consists of finding an optimal map T : W — > W which pushes po 
forwards to pi and minimizes the quantity : 

c{x, G{x))dpa{x), 

among all push-forward maps G (i.e. G^po{E) :— po{G^^ (E)) = Pi{E) for all Borel 
E subset of W). 

The usual strategy is to use characterization of optimal coupling, with the help of 
Kantorovich Potentials. Hence we need the concept of c— convexity. 

Definition 2.1 Let (p : W — > R. We say that ip is c— convex if : 

ip{x) — sup (<(5'^(y) — c{x, y)) \fx G W. 
yew 

where ipf , called c— transform of p, is defined as : 

ip^y) = inf {p{x) + c{x, y)) Vy G W. 

Notice that our cost c does not take an infinite value, so c— convex functions are 
real- valued well. 

It is well known that any optimal coupling is c— cyclically monotone i.e. its support 
(defined as the smaller closed subset oi W x W having full H— measure) is c— cylically 
monotone, namely : for all iV G N and [xi, j/i), . . . , (xat, j/at) G Supp{H) we have : 

N N 

^c{xi,y,) < ^c{xi,yi+i), 

i=l i=l 

with yN+i := yi. 

Rather there is equivalence (in our case) between optimality and c— cyclical niono- 
tonicity (see e.g. [3]). In particular if Supp{ir) is c— cyclically monotone then any 
coupling n such that Supp{YV) C S'upp(n) is also c— cyclically monotone. 

Let us precise what are the difficulties we have met. 

• When p — 1, c{x, y) — \\x — y||fc.-y is the considered norm on W . Hence if a map 
ip is c— convex then it is also 1— Lipschitz, hence — Lipschitz. Indeed : 

\p{x + h)~ p{x)\ < ||/i||fe,7 < Ck,j\h\H VheH VxeW. 

In this case, we have a version of Rademacher theorem (proved in [TU] and recalled 
in appendice for a sake of completeness) on Wiener space, for such iJ— Lipschitz 
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functions. With the remark above, it leads to c— convex functions on the Wiener 
space are almost surely differentiable. But the difhculty in this case is that 
the cost, being a norm, is not strictly convex, so we do not have injectivity of 
y I — > \/xc{x,y), therefore our method is not available. Nevertheless we shall 
use an other method to solve Monge Problem. Indeed thanks to the part 2. of 
Lemma (|2.ip . {W, ||.||fc,7) is a geodesic non branching space (i.e. geodesies cannot 
bifurcate). So we can apply method, detailed in ^6j. 

• When p > 1, c{x,y) = \\x — becomes strictly convex, we get this time 

the injectivity of V xc{x, .). But we lose the TJ— Lipschitz property of c— convex 
functions. Indeed if Lp is such function we can write : 

\^{x)-^{y)\ < \\\x-ai^-\\y-^\\lj 
< \\x-y\\k,-,M^, 

where the latter constant depends on ^ and cannot be bounded. However 
we will see that in this case c— convex functions (hence potentials) are locally 
i?— Lipschitz. Since differentiability is a local property, we should be able to 
apply Rademacher theorem again. 

Before we continue, let us set properties of the considered norm, which will be useful 
for the sequel. 

2.1 Properties of \\.\\k,'y 

We give two ingredients that will be essential for the sequel. 

Lemma 2.1 // we denote by F : W — > M+ the map F{w) = ||u'|lfc,7, then we have 
the following properties : 

1. F admits a gradient \JF{w) belonging to W* for all w G W^\{0}, where W* is 
the dual ofW. Moreover F^ is everywhere differentiable for all p > 1. 

2. F is a norm such that its unit ball is strictly convex. 

The first part of the proof is inspired from 
Proof : 

1. First we show the property for F :— F^^ . Take h £ W, we can write ior w G W and 
e > : 

F{w + eh)^ I I 1^_ |i+2fc7 



Jo 



And taking the derivative at e = 0, it is clear that lime_j.o ^^"^^ exists and 

moreover : 



\DHFiw)\ < 2k ^ \Ht)-h{.s)\dtds 

^ f \wit)-wis)r-' m-his)\ 

\t - s|(l+2fe7)(2fe-l)/(2fc) |i _ g|(l+2fc7)/(2fe) 
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and now applying Holder's inequality, we get : 

s (2k-l)/(2k) / X l/{2fe) 

\WW)\ < 2k[ I ' y dtds\ i J dtds] 




Hence h i — > DhF{w) is a bounded operator on W for all w € W. It leads to existence 
of a gradient WF{w) which belong to the dual space W* C H* = H (by (jLip ). Since 
F ~ its gradient satisfies VF{w) — F^^^^'''>~^{w)VF{'w) and in particular w 

must not be equal to zero. 

Since F is differentiable out of {0} it is just a remark to see that for any p > 1, F^ 
is differentiable everywhere over (W, ||-||fc,7)- 

2. This proof is the same as the proof of Minkowski's inequality. Indeed if wi, W2 € W 
and ij £ (0,1) then we have : 

1(1 - - + viMt) - 

[0,1]^ 



dtds 



|t_5|l+2fc7 

(1 - v)\wiit) - wiis)\ 1(1 - ri)iwi{t) - wijs)) + v{w2it) - W2{s))\^''-' 

- /[o,i]2 |t-s|(l+2'=7)/(2fe) |^_^|(i+2fc7_^-7) ^^'^^ 

V\Mt) - Ms)\ 1(1 - V){wi{t) - Wlis)) + 7j{w2it) - MsW'\ ^^^^ 



[04]2 \t - s|(l+2fc7)/(2fc) \t _ s|(l+2fe7-J)r-7) 

< ((1 - '7)ll^«l|U,7 + '?II^2|U,7) (11(1 - + VW2\\l':y)^''^^'' ■ 

The two inequalities above are respectively triangle inequality and Holder's inequality, 
and are in fact equality if and only if wi and W2 are almost everywhere colinear. This 
leads to the strict convexity of our norm. ■ 

Conditions on parameters (p, 7) are sufficient to have F g D5"(Vl^), as it is shown 
in [TT]. Notice that the (Gateaux) differentiability of F exists in the direction of W, 
hence in particular in the direction of H. 

2.2 The case p > 1. 

Throughout this subsection, the cost is c{x,y) = \\x — y\\^ ^, with p > 1. 

We follow Fathi and Figalli in [l3] to get around the fact that c— convex functions 
are not 1— Lipschitz with respect to ll.llfe,^, but nevertheless are locally Lipschitz with 
restriction to suitable subsets. The key argument is that the sup of a family of uniformly 
|j.||fe^7— Lipschitz functions, is also Lipschitz. 
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Theorem 2.1 Let po and pi he two measures on W satisfying lil.S\) and such that 
the first one is absolutely continuous with respect to the Wiener measure p,. Assume 
X(n) is finite for some 11 € r(/9o, pi)- 

Then there exists a unique optimal coupling between po and pi relatively to the cost 
c. Moreoever it is concentrated on a graph of some Borel map T : W — > W unique up 
to a set of zero measure for p. 

Proof : 

Let n e r(po, Pi) be an optimal coupling for c. We shall show that 11 is concentrated on 
a graph of some Borel map. It is well known (see e.g. [20]) that under condition X(n) is 
finite, since Supp{lV) is c— cyclically monotone, there is a c— convex map ip : W — > R 
(called Kantorovich potential) such that 

ip''{y)-ip{x) = \\x~y\\l.^ n-a.s. 

Moreover from the definition of c— convexity, we also have 

V'{y)-ipix)<\\x-y\\l^ y{x,y)eWxW. (2.6) 

Since (p'^ is finite everywhere, if we consider subsets Wn ■— {ip^ < n} for n e N then : 

W„ C W^„+i and y W„ = W. 

Our cost c{.,y) — \\. — is locally Lispchitz locally uniformly in y. Hence 



for each y & W there exists a neighborhood Ey of y such that (||. — l^gp is a 



^7 

uniform family of locally ||.|lfe^-y— Lipschitz functions. Moreover W being separable, we 
can find a sequence {yi)iefi of elements of W such that : 



U = ^- 



Now consider increasing subsets of W : 



Vn:=Wnf]{[jEy,). 



We can define maps approximating p as follow : 

Pn-.W W 

X I — > sup (p^iy)- \\x~y\\l. 
yev„ ^ 

Notice that 

Pn{x) = nia-K sup ( <^''(y) - - y||^ ) . 

But since 1^9 < n on Wn and — IMI^..^ is bounded from above, ipn is also bounded 
from above. Therefore the sequel {p'^{y) — ||. — J/Hf ^)yew„n£;„, is uniformly locally 
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||.||fe.-y— Lipschitz and bounded from above. Finally Proposition [J] in Appendices shows 
that ipn being a maximum of uniformly locally 1 1 . 1 1 /^^^ — Lipschitz functions, is also locally 
||-||fc,7— Lispchitz. We can extend (/?„ to a Lipschitz function everywhere on W 

still denoted by ipn. By p.ip . we get : 

\ipniw + h)^ ^niw)\ < C\\h\\k,^ < 2C\h\H Vw eW,yhe H. 

Namely, ipn is a 77— Lipschitz function. Thanks to Rademacher theorem, there exists a 
Borel subset Fn of W with plain /Lt— (hence /3o~)measure such that for all x € i^„, ifn is 
differentiable at x. Then for each x E F n„F„ (which has also plain po— measure), 
each (fin is differentiable at x. 

By increasing of (Ki)„, it is clear that ipn < (fn+i < 'P everywhere on W. More- 
over with same argument as in [TJ], if P,i Pri [Suppili) n {W x Vn)) then f\p^ = 
Vn\p„ — fi\p,^ for alH > 71 and all ti G N. Fix x e P„ n F. By definition of P„ it exists 
Ux & Vn such as : 

i.e. if^ivx) - (p{x) = \\x-yx\\l^^- 

Subtracting p.6p with {x',yx) to the previous equality, we get for all x' € W and 
h€H : 

Lp{x) - ip{x') > \\x - y^Wl ,^ - \\x' - yxWl^^- 

Taking x' = x + eh with e > 0, h E H , dividing by e and taking the limit when e tends 
to 0, we get (by linearity in h) : 

y^{x)+Vxc{x,yx) = 0. (2.7) 

Indeed c{.,yx) is differentiable at x thanks to Proposition 12. II The strict convexity of 
c{x,y) — \\x — ^ yields Vj,c(a;, .) is injective and p.7|) gives : 

y^ = {V.,c{x, .))-\-Vip{x)) T{x), 

where {Vxc{x,.))^^ is the inverse of the map y i — > Va;c(a;,?/). Notice here that T is 
uniquely determined. We deduce that Suppili) n {W x y„) is the graph of the map T 
over Pn^F for all n S N. But (P„)„ and (Ki)„ are increasing and such that IJ^^ Vn ~ W . 
Therefore Suppiji) is a graph over Pri{Supp{Y\)) n F with Pri[Supp{Y\)) = 1J„ P„. 

We can extend T onto a measurable map over W as it is explained in [13]. We 
obtain S'upp(n) is included in the graph of a measurable map T , unique up to a set of 
/5o~iiieasure. In other words 11 — [id x T)^po. 

We have proved that any optimal coupling is carried by a graph of some map. So 
if Hi, 112 S r(P0i Pi) a-re optimal for then any convex combination of Hi and 112 

is also optimal. Take 11 := ^(lli +112) be an optimal coupling between po and pi : 
there exists some measurable map T such that 11 — {Id x T)^pq. Let / be the density 
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of Hi with respect to H. Then for any continuous bounded functions (p we have : 



This yields f{x,T{x)) = 1 pQ—a..e., hence f ~1 11— a. e. It leads to 11 = Hi and finally 
n2 - Hi = {Id X T)#po. m 



2.3 The case p = 1. 

Throughout this subsection, the cost is c{x,y) = ||x — 2/1^,7- 

We follow the method of [5] developed by Bianchini and Cavalletti in [5]. For this 
we will need notions of transport rays, transport sets and endpoints. By strict convexity 
of our norm ||.||fc,'y, {W, ||-|jfc,7) is a geodesic non branching space. 

Let us recap briefly the different steps of their method : 

1. reduce the initial Monge-Kantorovich Problem to the (one-dimensional) Monge- 
Kantorovich Problem along distinct geodesies (this is possible since the space is 
non-branching) . 

2. verify that the conditional measures provided by disintegration of both measures 
po and pi on each geodesic are without atoms (this is possible thanks to properties 
of Gaussian measure) in order to apply result of one-dimensional Monge Problem 
: on each geodesic there exists a transport map. 

3. piece obtained maps together to get a transport map for the initial Monge Prob- 
lem (this is possible by a general selection theorem). 

We present only sketches of results, which are very similar with the Cameron-Martin 
norm \.\h- For all details, consult [8j and |6J. In our case, the cost \\.\\k,-y is smooth 
enough to guarantee the existence of a Potential (p such that for all optimal coupling 
n we have : 



Definition 2.2 The set of oriented transport rays is defined as : 

G -.^ {{x,y) eW X W; 3(u;,z) e F; ||w - x\\k,y + \\x - y\\k,j + ^11^,^, ^ \\w - z\\k,j} . 

We denote by G{x) the outgoing transport rays from x £ W and by G^^{x) the in- 
coming transport rays in x G W . Finally define the set of transport rays as : 




Suppin) C F := {{x, y)eWxW; ^%y) ~ (^(x) = \\x - y\\k,j}. 



R:=GUG 
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Definition 2.3 The transport set (with endpoints) is defined as : 

Te ■■= {x e W- R{x) ^ {x}}. 
The transport set (without endpoints) is defined as : 

T:={xe W- G{x) ^ {x}, G-\x) ^ {x}}. 

Here the important fact is the following : the space (W, ||.||fc,7) is non branching 
hence the set R{x) is a single geodesic for each x e T. It leads to the fact that RnT^^T 
is an equivalence relation on T. Equivalence classes form a partition in 7~ and therefore 
we can apply a theorem of disintegration of measures, as it is explained in [8], provided 
there is a po-measurable cross section f : T — > T for the ray equivalence relation R 
(see [S] for the terminology). 

This is possible to follow the part 4. of [3] to prove the existence of such cross 
section /, since geodesies of {W, ||-||fe,7) are continuous and locally compact. 

Definition 2.4 The multivalued endpoint graphs are defined as : 

a {{x,y)eG-'; G-\y)\{y} ^ 

b := {ix,y)eG; G{y)\{y} ^ 

Then Pr2{a) and Pr2{b) are respectively called initial points and final points. 

Since from a point of a{T), many geodesies can start, proposition 12.31 is necessary 
to see in fact that such set has po— measure null. It is the same for 6(T). Hence 
the transport rays provide a partition of T up to a /i— negligible set, and we disinte- 
grate the measure /i on 7^ w.r.t. to this partition. Proposition l2.4l savs that conditional 
measures have no atoms, hence we get an optimal map for the induced Monge Problem. 

From now, let us consider an optimal transference plan H between po ^iid Pi rel- 
atively to the cost c{x,y) = \\x — y\\k,'y Denote by H„ — (7r„)^n its image by the 
projection 7r„. We know that H„ is an optimal coupling between Pq :— (7r„)^po 
Pi :— {■Kn)#Pi for the cost on Vn induced by c. This latter cost is a norm over 
Vn '■= {'Kn)#W which inherits strict convexity. So by [9J, the Monge Problem in these 
settings admits at least one solution, say r„. In other words, if we denote by r„ the 
support of H„ then we have r„ — Graph(Tn) . First we have two technical propositions. 

Proposition 2.1 Assume that there exists M > such that densities of Pq and p" 
are bounded by M almost everywhere. Then the following estimate holds true for 
all compact subset A <ZW : 

lniTn.M))>^PoiA) Vte[0,l], 

where Tn^t '■— (1 — t)Id + tTn. 

The proof is quite the same as in [S]. Only difference is to consider Monge maps 
for the cost induced by with [p > 1), instead of |.|^. Indeed costs satisfy 

conditions of Theorem 6.2.7 in [4], so well that optimal maps T^ are approximately 
differentiable. 

Come back to the Wiener space. We have the following approximation result : 
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2.3 The case p = 1. Monge Problem on Wiener space with 



IP 



Proposition 2.2 Assume that there exists M > such that for all compact subset 
A C W , the following holds true : 

7„ {Tt (r„ n A X ly)) > Mp'^ (A) Vn G N. 

Then for all compact subset A C W , we have : 

^iiTt{TnAxW))>MpoiA), 

where T is the support of 11 and Tt{x, y) := (1 — t)x + ty. 

The proof (established again in [8 ) uses generahties of measure theory, Hausdorff 
topology. It is true with general measures, provided the cost is at least lower semi- 
continuous. 

We can head for the solution of Monge Problem. 

The first step is to prove that set of initial points has po "Measure zero and final 
points has pi— measure zero. We denote for the sequel hy m := f^po and ■= f^p 
the image measures by / (being the cross section). 

Proposition 2.3 // Pq and pi are absolutely continuous with respect to the Wiener 
measure p then : 

po{a{T))=0 and pi{b{T)) ^ 0. 

Proof : 

We can only prove that /9o(a(T)) = 0. Let /o be the density of po with respect to p. 
Assume that po{a{T)) > 0. Let A C a{T) and 6,M > such that po{A) > and for 
every x € A, 6 < fa{x) < M. Consider the restriction p^j- of p with respect to T and 
its disintegration : 

A*|r= / t^ymf^{dy), py{T) = I - a.e. 
Js 

Now consider the initial point map a : S — > A and the measure image a^m^. We 
observe that pq\a is absolutely continuous with respect to a^m^. Indeed we have : 

VB C A, po{B) >0^ KR{B) n T) > 0. 

Hence there exists a subset A d A oi positive a^m^— measure such that the map 
h : A — > M defined as : 

Kx) -^ix) 
da^rrif^ 

satisfies h{x) < M' for some positive constant M' . 
Now let us introduce 



A(-) := / h{a{y))py{.)dm^{dy). 
J R{A)r\S 
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2.3 The case p = 1. Monge Problem on Wiener space with 



IP 



For S C S we have : 

PolAi^yesRiy)) = Po|i(«('^)) 



h{w)d{a^m^){w) 

a(S) 

h{a{w))m^{dw) = fj,{Uy(zsR{y))- 

This yields that Te is still a transport set for the transport problem between and 
ft. Moreover these two measures have uniformly bounded densities. Hence if we project 
them, we obtain the finite dimensional estimate of Proposition 12.11 Then we get the 
infinite dimensional estimate thanks to Proposition [221 : there exists some C > such 
that for any compact subset B of 14^, we have : 

li{TtirnBx W)) > Cpo{B). 

This is the case for A considered above. Then we can find a sequence (i„)„ converging 
to such that : 

^(rt„(r nAxw)) > Cpo{A) > sc^i{A) 

and ^nTtjrnyl X W) = 0. 
If we denote by A'^ := {x e W; \\A — a;||fe < e} then for t„ < e/M we have : 

piA') > ^i{A) + p.{Tt^ (r n X T4^)) > (1 + C5)p{A). 
But iJi{A) — linie^o and we get a contradiction. The result follows. ■ 

Here we state a fundamental point which will allow us to apply result of Monge 
Problem in one dimension. 

Proposition 2.4 // po o,iT-d pi are absolutely continuous with respect to the Wiener 
measure p, then for m—a.e. y € S, the conditional probabilities po,y and pi^y have no 
atoms. 

Proof : 

We can only prove for the first measure. Let /o and /i be respectively densities of po 
and pi with respect to p. Assume that there exists a measurable subset S of S such 
that m{S) > and for every y G S there exists Xy such that Po,yi{xy}) > 0. From 
now we restrict both pQ and pi to R{S) and denote again with pQ and pi. Let us 
consider the sets Ki,M '■= {x & X; fi < M} for i ~ 0,1. For M sufficiently large, the 
conditional probabilities of the disintegration of Pq\Ko m have atoms. 
Define po,y,M := Po.y\Ko,An Pi.yM ■= Pi,y\Ki_M and 

DiN) -.^{yeS; h{y) -.^ PJl:lM^ < n 

I PhyM{R[y)) 
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Convexity of relative entropy on Wiener space 



For N sufficiently large, we have m{D{N)) > 0. Hence the map h : D{N) — > M is 
well defined and permits to introduce : 

P0-= Hy)PO,y,Mm{dy), Pl-^Pl\R(D(N))nKiM- 

Jd(n) 

These measures have bounded densities with respect to /i and the set T := TC^G{K^)_s)C^ 
G~^{Ki^s) is a transport set for the transport problem between po and pi. It follows 
that fi, := fi^j- satisfies conditions of Proposition 12.21 
Suppose po{Pr2{A)) > where : 

A:= UixeRiy); Po,(W) > 0} 
yes 

is a Borel set. It is well known that A is a countable union of Borel graphs (Lusin The- 
orem). If we take one of such graph A we have m{Pri{A)) > hence by disintegration 
Po{P'''2{A)) > 0. Applying Proposition 12 . 21 we get : 

/i(Tt(r n Pr2{A) xW))> Mpo{Pr2{A)). 

But Tt(r n Pr2(i) xW)n (Fr2(i)) = then letting t — ^ we get a contradiction 
and it follows po{Pr2{A)) = 0. 

In particular the conditional probabilities poy of the disintegration of po have no 
atoms. We get a contradiction since poy is absolutely continuous with respect to p^ ^^j- 
and the latter measure has atoms. ■ 



Theorem 2.2 Let po and pi be two measures on W satisfying ^.3\) and such that 
both are absolutely continuous with respect to the Wiener measure p. Assume 1(11) is 
finite for some H € T(pQ,pi). 

Then there exists an optimal coupling between po and pi which is concentrated on 
a graph of some Borel map T : W — >■ W . 

Proof : 

We can process as Theorem 8.3 in [B], putting H.Hfe.^ instead of \.\h- ■ 



3 Convexity of relative entropy on Wiener space 

This part is split in two sections. The first one contains recalls in finite dimensional 
cases and an extension of known results for uniform and norms on M"*. The 

second one will need this result to show our purpose on the Wiener space. Throughout 
all of this section, the cost c will be a distance induced by a norm. So that it takes 
sense to consider Wasserstein distance. 

We consider the relative entropy with respect to a reference measure 7 defined as : 

Ent (p) I log(/)^7 if P admits / for density w.r.t 7 

' y +00 otherwise ^ ' ' 
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3.1 Finite dimensional cases 



Convexity of relative entropy on Wiener space 



We need to recall the notion of geodesies on the space of probability measures, t e 
[0, 1] I — > pf^ e V2{W) is a (eonstant speed) geodesic, provided : 

W2Apf\p^s^) - {t-s)W2APO.Pl) VO < S < i < 1. 

One can obtain a constant speed geodesic by letting pi := ((1 — t)Pri + tPr2)#n, 
Vt € [0,1], if n is an optimal coupling for c. In fact each optimal transference plan 
involves a constant speed geodesic (see for instance [3]). Moreover on non branching 
space, whenever the optimal transference plan is unique, there is a unique constant 
speed geodesic between po a-nd pi. For example, Banach space with strictly convex 
norm is non branching, while Banach space with non strictly convex norm is branching. 

Thanks to this definition, one can consider the notion of convexity (and weak con- 
vexity) along (constant speed) geodesies. 

Definition 3.1 For po, pi two probability measures with second finite moments (for 
a cost c), we say relative entropy with respect to a reference measure p, is weakly 
iC— convex along (constant speed) geodesies on {'P2{.),c) if there exists a (constant 
speed) geodesic p^f^ induced by an optimal transference plan belonging to rg(po,/Oi) 
such that : 

Ent^pi"^) < {l-t)EntM + tEntApi)-^^^^^^Wl,{po,pi) Vie [0,1]. 

We say that relative entropy is convex along (constant speed) geodesies on (7'2(-)i c) 
(not weakly) if the latter inequality holds for all (constant speed) geodesies p'f K 



3.1 Finite dimensional cases 

We want to show the relative entropy (with respect to the standard Gaussian measure 
7d) is i^— convex along (constant speed) geodesies in the finite dimensional space R'' 
endowed with the uniform norm and the norm ||.||/c,7- For the first one, we need to 
consider the cases where M.'^ is endowed with different norms. For p positive integer 
we set : 

/ d 

Cp{x,y) := \x-y\p=^ ^ |a;,| 

\i=l 

Let us begin to recall if— convexity along geodesies for suitable norm on M'*. 

Proposition 3.1 Let \\.\\ be a strictly convex and differentiable norm on R''\{0}. // 
V^ll-ll ^ l-|2 then relative entropy w.r. t. -fd on is K— convex along (constant 

speed) geodesies on {'P2{^'^), W2), where W2 is the Wasserstein distance induced by \\.\\. 

By assumption on ||.||, here (R'^, ||.||) is a non branching space. 
Proof : 

Let po a-iid pi be two probability measures absolutely continuous with respect to 7^ 
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3.1 Finite dimensional cases 



Convexity of relative entropy on Wiener space 



(hence C) (otherwise the result is trivial). For i = 0, 1 let dpo ~ fodC and dpi — fidC, 
then the density of probability of pi with respect to is ^ = /i(27r) 2e~2^. Write : 

EntM = I ^ Mx){27T)h^ log (^Mx){27T)h^^ d-fd{x) 

n{x){27r)h^ log{Mx))d-fdix) + f /,(x)(27r)^e^log((2^)^)d7<j(a;) 



+ I f,{x){2n)h^^-^djd{x) 



/I d 
-\x\ldp,{x) + -\og{2Tr) 



* First term of the latter equality is relative entropy with respect to C. Since is 
strictly convex and differentiable, it suffices to follow f^ to see that it is convex along 
geodesies on 7^2 (R'', ||-||)- 

* Let us show that the second term of the latter equality p i — > J ^\x\2dp{x) is 
1— convex along geodesies on p2(R'', ||-||)- We know that the map x i — > ■^\x\2 is 
1— convex i.e for all xi, X2 ^ and t in [0, 1] : 

^\{l-t)xi+tX2\l < + ^^\X2\1 - ^''^ ^ *\ xi - X2\l 

< F1I2 + 2l^2|2 2 Ifi-2;2|1. 

Consider an optimal coupling 11 (for ||.|p) between pg and pi. Then integrating the 
previous inequality w.r.t. 11, it comes : 

l\{l-t)xi+tx2\ldn{xi,X2) < ^ / \xi\ldpo{xi) + ^- f \x2\ldpi{x2) - ^^^^ ~ ^ V |(po , Pi ) ■ 



* Finally relative entropy with respect to 7^ is 1— convex along (constant speed) 
geodesies as sum of 0— convex and 1— convex maps. ■ 



We will apply Proposition 13.11 in the following cases : 
• For the norm with 2 < p < 00 [K = 1). 



• For the norm induced by \\]\k.^ {K = \/C\^). 

We shall extend the result for the uniform norm |.|oo- This latter fact is a priori 
not obvious since \x — is neither strictly convex nor differentiable on R''\{0}. 

Now the question is : what is happening when p equals +00 1 When one changes 
the cost function, two items change in the above inequality : Wasserstein distances but 
also (constant speed) geodesies which depend implicitely on an optimal transference 
plan, which depends itself on cost function. 

Let po and pi be definitively fixed, as probability measures on with finite second 
moments (for |.|oo)- In particular : W2,p(po7Pi) < 00 for all p > 2. We know that for 
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3.1 Finite dimensional cases 



Convexity of relative entropy on Wiener space 



p > 2, there exists a unique optimal transference plan Ilg between /lo and /ii (for 
cost function c^). Thus we can watch behavior of the sequence {Ill^^)p. Whenp varies, 
cost function varies too. In fact {Cp)p converges to c and it would be interesting if the 
sequence of optimal coupling converge to an optimal coupling for infinite cost. Indeed 
it appears that up to a subsequence, (IIq^-* )p weakly converges to a probability measure 
which will be an optimal transference plan for the infinite cost. This fact combined 
with the property of lower semicontinuity of the relative entropy, which one adds the 
nonincreasing of the following sequel 

will yield 1— convexity of relative entropy along geodesies on {M.'^, Moo)- 

To prove the weak convergence of (llg^'' )p a first (easy but useful) remark is that the 
sequel is tight. This yields, thanks to Prokohov's Theorem, there exists a subsequence 
(n^^'^^)p, that we wiU denote by (n[,^^)p again, converging weakly to a measure 11°°. 
It is easy to check that 11°° is a coupling of /iq and /ii . And we wish to see that 11°° 
is optimal for the cost c^, where c{x,y) := \x — y\oo- The difficulty lies in the fact 
that (cp)p converges to c but not uniformly. Nevertheless, it will work because the 
convergence is uniform on all compact subsets of W^. 

The next Lemma which appears to be an essential point for this purpose, is taken 
from [3]. Recall that for a probability measure /i, its support Supp{ii), is defined as 
the smallest closed set on which /i is concentrated (i.e. M.'^\Supp{^) is /i— negligible). 

Lemma 3.1 For all x G Supp(Il°°), there exist Xp € Supp{Ill^'^) such that lim{xp) ~ 

X. 

For self contained paper, here we include the proof. 
Proof : 

Let X e Supp{n°°) C M'^ X M'^. Thus n°° attributes a nonzero mass for all open ball 
of center x. So let k G N*, we have 'n°°{B{x, 1/fc)) > 0, where B{x, 1/k) is the open 
ball centered at x and radius 1/fc. Thanks to the weakly convergence, we have : 

liminf n^^\B{x, 1/fc)) > n°°(S(a;, 1/fc)) > 0. 

p ^+oo 

This inequality let us define an increasing sequence {jk)k such that : jo := and for 
fc > 

jk := mm{p eN, p> jk-i, Wn>p : S'upp(n^"') n B{x, 1/fc) =1 0}. 

The increase yields that for all fc G N, there exists jk l£ P < jk+i such that we can pick 
up a point Xp e Supp{Il^f^^) n B{x, 1/fc). The sequence {xp)p converges to x and for all 
p e N, we have Xp e Supp{Il^Q^). m 

Remark: In fact this proof (hence this lemma) is true for all sequence of measures 
converging weakly to another measure, on any metric space. 
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3.1 Finite dimensional cases 



Convexity of relative entropy on Wiener space 



Proposition 3.2 n°° is optimal for the cost (? . 

Here we use the equivalence between optimality for (? and (? -cyclical monotonicity 
(see |20] for continuous and real valued costs c). 
Proof : 

It is then sufficient to prove that 51*^^(11°°) is c— cyclically monotone. Let N £ W 
and {xi,yi),. . . ,{xN,yN) e Supp{W^). Since {^l^^)p converges weakly to n°°, we 
can applicate the Lemma [3.11 : for alH = 1, . . . A^, there exists (x^, yf) g Supp(Ill^^) 
such that lim(xf,j/f) = {xi,yi). Thus (xf , j/i), . . . , (x^, y^) S Supp{Ii^Q^) which is 
Cp— cyclically monotone, since Ilg^'* is optimal for the cost Cp. Then the inequality 

N N 

E^p(^-2^n<E4(^-2^f+i) (3-9) 

1=1 1=1 

holds, with yN+i '■= yi- And it is elementary to check that the sets 

Up>2{(a:?,2;i); {xN^yN)}[J{i^i'yi)^ • • • ' {xN,yN)}, 

iip>2{{x^i, yl), {x%,yl)} |J{(a;i, y2), ■ • ■ , {xN,yi)}, 

are compact of M'' x M^. But since (cp)p converges uniformly on compact subsets of 
X to c, we get from p.9p , taking the limit with p +oo : 

N N 



^c^{xi,yi) < ^c^{x^,y.,+i). 

1=1 i=l 

That is exactly the definition of c^— cyclically monotone for Supp(n°°). 



Because of non strict convexity of |.|oo, (R'', Moo) is a branching space : there exists 
many constant speed geodesies between two probability measures. Finally one can 
conclude this section with the following result : 

Proposition 3.3 Relative convexity w.r.t. jd on (R'',|.|oo) is weakly 1— convex 
along (constant speed) geodesies on (P2(R'^), W2,oo)• 
Proo/ ; 

For p e [2,+oo) we consider optimal transfere plans Hq''^ £ Tq{pq,pi) and constant 
speed geodesies associated. Applying Proposition 13.11 with |.|p norms, we get : 

Ent^,{p[P^) < (1 - t)Ent^,{po)+tEnt-,M - ^^^^W/|,p(Po, Pi), (3.10) 
for all p>2. But for x £ M'' and for p > 2, \x\p > \x\ac- Then : 
\x-y\ldll^^\x,y) > [ \x-y\ldU^^\x,y), 

T^|.p(Po,Pi) > / \x~y\lodIiix,y)^Wl^ipo,Pi). 

=^liminfM^| (po,Pi) > 1^1,00 (Po, Pi), 
p 
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3.2 Wiener space case 



Convexity of relative entropy on Wiener space 



Moreover, the sequel (IIq )p weakly converges to n°° and this coupling is optimal for 

|.|^ thanks to the previous proposition. Hence the sequel {p'f ^)p weakly converges to 
for all t E [0, 1]. But since the relative entropy is lower semi-continuous, we have : 

p 

Finally, combining this two arguments, taking the liniinf in the inequality p.lOp with 
respect to p : 

Ent-,, [pT) < (1 - t)Ent-„ (po) + tEnt-,, (pi) - fc^W^2^ (po, p^). 



3.2 Wiener space case 

Consider the same notations of the previous settings and denote by An the sub ct— field 
on W generated by 7r„. Let us recall the following results : 

1. (7r,i)#p — 7„ the standart Gaussian measure on Vn- 

2. For z = 0, 1, p" :— {'!Tn)^Pi is absolutely continuous with respect to 7„ with 
density /f where o ^„ = E[/,|A.] =: /f- 

3. For I = 0,1, fr ^ .Urn L\W,p). 

Remark: In the following proof, we need to apply Proposition 13.31 which deals with the 
infinite norm in M". It turns out that the above approximation by (7r„)„ doesn't lead 
us to the infinite norm. We get only a modified (by an inversible matrice) infinite norm. 
But the previous section can be applied to this modified norm, so that Proposition 13.31 
also holds in our case. 

Theorem 3.1 Relative entropy w.r.t. p on (W, |.|oo) is weakly 1— convex along (con- 
stant speed) geodesic on {V2{W),W2^oo)- 

Proof : 

Step 1. Since /" — /" o 7r„ — > fi, we have : 

p^ := (/f o 7r„)p "^'V.p = p, 1 = 0,1. 

Then in [3], it is proved that for 11^ e rg°(pQ , p") there exists a subsequence of (II^)„ 
denoted by (IIJ^ )k which weakly converges to 11°° and we know this limit point belongs 

torg°(po,pi). ' 



step 2. VF2,oo(po",pr) < VF2,oo(/0^,P?) 

We have the decomposition W = Vn (B V^. Then define a probability measure 11 € 
T{p^,p1) as: 

/ (f{x,y)dn{x,y) -.^ / / ^p{xi + z,X2 + z)dnn{xi,X2)dpo{z), 

JWxW JvJ-Jv„y.V„ 
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for (fi bounded continuous function, and n„ G rg°(pQ,p"). So : 

/ \x-y\lod^ix,y) = / Ixi - X2|^dn„(a;i, j/i) 

JWxW Jv„xV„ 

= W^I,oo(P0,Pl), 

=^W^I,oo(Po,/5i) < WlM,Pi)- 

step 3. 

Now we have 11^ :— (TTn^., 7r„j.)^n^ S rg° (po i Pi *" ) a-nd if we consider the associated 
constant speed geodesic p"'' , we can apply Proposition 13.31 to obtain : 

Ent.^^ipr) < (l-0^;n^^„Jp^,'^)+^^;n^,„Jpr)-^^i^p^|,^(Po"^p'^) yt e [o, i]. 

(3.11) 

Let t G [0,1] I — > p"*" :— ((1 — t)Pri + tPr2)^tl-'^^ be the constant speed geodesic 
associated to 112° . We have : 

"■fc 

Ent^.^ip'i") = I fo'ix)\og{f^'-{x))dj„,{x) 

= I f^'My))^og{roH^nM))dKy) 

Jw 

= I ftiy)^og{f^''iy))dp{y)^Ent^{p^-). 
Jw 

And on the same way, we can easily prove this for p"* and p"*" where t e (0, 1). Thus 
([XTT|) and Step 2. yield : 

Ent.ipn < {l-t)Ent,{p-^)+tEnt,{pr)-^-^^Wl^{p->'J-^^ Vi G [0,1]. 

(3.12) 

Step 4. 

Thanks to Step 1., we have : 

Wl^{po,Pi) - / \x-y\ldn^{x,y) 

JWxW 

< liminf / |x - y\l,dnZ{x,y) = liminf H^|,^(p^^ p;"-). 

JWxW 

Hence for e > 0, there exists e N such that : 

Wlo^iPo" , pT ) + e > (po, pi) Vfc > N. 
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Now fix fc > A^. By the Jensen's inequafity, we have for i = 0, 1 : 

Ent^ip^") = / /r(y)log(/rH2/))rfA^(y) 
Jw 

= [ E[/,|^„J(y)log(E[/,|A.J(y))dM(y) 
Jw 

< [ E[/,log(/,)|^„J(y)d/i(y) 
Jw 



and then 



'w 



Ent^ipl'") < (1 - t)Ent^{p'^') + tEnt^ip'l") - \^^—^W^'^^{po, pi) - e 

< {l-t)EntM + tEntM- {j^^-Y^Wl^{po,pi)-e 
Moreover : 

Since HJ^ weaUy which is optimal due to Lemma 3.1, 

we have p^" pt := ((1 - t)Pri + tPr2)#n°° . 

Denote by R the right hand side of the previous inegahty, and by compacity of {v = 
pp € 'P2{W), Ent^{v) < R} with respect to the weak topology, we have : 

EntM < il-t)Entf,{po)+tEnt^{pi)- (^^^^-^wl^{po, pi) - e 

Finally letting e — !• we get : 

Entf^ipt) < (1 - t)EntM + tEnt^{pi) - ~ ^V |^^(po, Pi) Vi G [0, 1]. 



The same proof holds for the cost \\-\\k,'y by applying in Step 3. Proposition 13.11 
with < Cfe_^|.|2 instead of Proposition 13.31 Even better, since the latter norm is 

strictly convex, (W, ll.llfc.7) is a non branching space. Hence we have the Theorem : 

Theorem 3.2 Relative entropy w.r.t. p on (W, ||.||fc,7) is convex along (con- 

stant speed) geodesies on ('P2(W^), W^2,(fc,7))- 

• In Step 2. of first proof we have in fact equality W2,oo{Po , Pi) = M/2,oo(Po ' Pi )• 
This is provided by the fact that |7r„(w)|oo < \w\oo for all w e W. Indeed : we 
know that 11^ is optimal. Define n„ (7r„, 7r„);j^II^ € 7^(Ki x Vn)- Since the 
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norm on Vn is less than the norm on W, we have : 



/ \x - y\lodUn{x,y) = / |7r„(a;)-7r„(y)|^rfn~(x,y) 




< / \x-y\ldIl^{x,y)=WlM,Pii), 



JwxW 



• In the second norm, it is still not clear if ||7r„(«;)||fe,-y < ||w||fc,7 for any w G W. 



Theorem .3 Rademacher's Theorem. If f & Lp{W) is a \\.\\k,'y— convex map then 
ip belongs to Df(VF). 

Proof : 

Our discussion above gives us for /x— almost all x and y in : 

\ip{x) - ip{y)\ < c{x,y) = \\x - y\\k,j < 2\x-y\H- 
This can be rewritten for all w £ W and &l\ h £ H hy : 

\ip{w + h) - ip{w)\ <2\h\H. 
Fix I gW* C H. Consider the set 



and an orthornomal basis (y„)„ of Y, where Y is such that W = Y (B Span{l). By 
denstiy one can write : 



And the measure of {t G M; G(y„ + tl, I) does not exist} is equal to zero, thanks to the 

Rademacher's Theorem on R. Then /i(A(Z)) = 1 and from now we extend G(., /) on W 
by set G{w, I) = for all w ^ A(^). By assumption we have for all e > and w G W: 



Appendices 





n 



\(p{w + el) - ip{w)\ 



< 2 



= 2\1\h 



e 



e 



23 



This implies that : 



\G{w, l)\ < C\1\h Vw e W, and e W*. 

Now consider : 

{n n ^ 

^ ttfeCfe; a/c e Q and ^4 = lL ^=|J An- 
fe=l /c=l J n>l 

If / = a/jCfc e >l then we have /U— almost surely : 

n 

G{w,l) = ^{l,ek)HG{w,ek). 

k=l 

Therefore for / G ^, it exists n G N such that I belongs to An and a Borel subset i?„ 
of W with plain /i— measure, such that we have : 



G{w, I) = ^{l, ek)HG{w, Bk) Vw e Bn- 

fc=i 

Since A is countable, take B — CinBn which has plain /x— measure and satisfies : 

oo 

G{w,l) = J2il'('k)HG{w,ek) y-wGB, \/l G A. 



fe=i 



Now define the map G{'w) = X^^^ G{w,ek)ek ior w & B and equals to if w ^ 
Notice that : 



{G{w), 1)h = J2 ^(^' ^k){ek, 1)h = G{w, I) G B, \/l G A. 

k>l 

Then we have for w G B : 

\G{w)\% = 



G{w),^G{w,ek)ek 
G w,^G{w,ek)ek , 



H 



and the latter term being less than 2\G{w)\h, we get a bound from above of \G{w)\h ■ 

\G{w)\h<2 ywGB. (.13) 

At last we have for h G H : 



ip{iu + eh) — ^{w) 



-{G{w),h)H 



p 




< 


1 


p 


JA(h) 



(p{w + eh) — ip{w) 



-{Giw),h)H 



dfiiw). 



24 



But thanks to (j.lSp and Cauchy- Schwartz inequahty, we have 



ip{w + eh) — (p^w) 



iGiw),h)H 



< 2^\h\H 



and this latter function belongs to L^iW). So applying dominated convergence theo- 
rem, we get : 

ip{'w + eh) — (p{w) 



lim 

e-^0 



-iGiw),h)H 



= 0. 



This shows that Dhf{w) = {G{w), h)H exists /i— almost surely in w. Moreover G{w) 
is the gradient of if at w. Finally G belongs to LP{W,H) i.e. if S Df(W^). ■ 



Definition .2 We say that a function f : W — > W is locally ||.||fc_^— Lipschitz if for 
all R > 0, there exists > such that : 

\fiw)~ fiw')\<GR\\w~w'\\k,j yw,w' eBk,yiR):={weW; \\w\\k^^ < R}. 

Proposition .4 Let {fi)iei be a uniform family of locally \\.\\k.-y— Lipschitz real- 
valued functions defined on an open subset U of W . If the function 

/(a;) := snp f^{x) 

is finite everywhere then f : U — > M is also locally \\.\\k^-y^Lipschitz. 
Proof : 

Let R > such that Bk,j{R) C U. There exists Cr > (not depending on i) such 
that for all w,-!!)' G Bk,-y{R) : 

\f^M-f^{w')\ < CnWw-w'Wk^^ Vie I. 

For each w G Bk^-y{R) by definition of / there is a sequence {in)n (depending on w) 
such that lim„ fi„{w) = f{w). Moreover fi„{w') < f{w') for all w' G Bk^^{R) then : 

f^Sw)- f{w')<fM- f,Sw')<CR\\w-w'\\k,^. 

Passing to the limit in the previous inequality, we get : 

f{w)-f{w') < CrWw-w'U,^. 

Exchanging w and w' , this yields / is locally || Lipschitz. ■ 

Acknowledgements. I would like to thank Professor Shizan Fang, for his sug- 
gestion, and so Nicolas Juillet for useful discussions about general facts of optimal 
transportation. 
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